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Abstract 

We introduce a generalization of relative entropy derived from the Wigner- 
Yanase-Dyson entropy and give a simple, self-contained proof that it is convex. 
Moreover, special cases yield the joint convexity of relative entropy, and for 
Tr K* APR B 1 ^ Lieb's joint concavity in (A, B) for < p < 1 and Ando's 
joint convexity for 1 < p < 2. This approach allows us to obtain conditions 
for equality in these cases, as well as conditions for equality in a number of 
inequalities which follow from them. These include the monotonicity under 
partial traces, and some Minkowski type matrix inequalities proved by Lieb 
and Carlen for Tri(Tr2 A^) 1 ■ In all cases the equality conditions are inde- 
pendent of p; for extensions to three spaces they are identical to the conditions 
for equality in the strong subadditivity of relative entropy. 
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1 Introduction 



1.1 Background 

For matrices A\2 > acting on a tensor product of two Hilbert spaces, Carlen 
and Lieb [TJ [S] considered the trace function [Tr 1 (Tr 2 Af 2 ) 9 / p ] and proved that 
it is concave when < p < q < 1 and convex when 1 < q and 1 < p < 2. 
They showed that this implies that these functions and the norms they generate 
satisfy Minkowski type inequalities, including a natural generalization to matrices 
A U 3 acting on a tensor product of three Hilbert spaces. They also raised the question 
of the conditions for equality in their inequalities. When q = 1, we show that this 
can be treated using methods developed to treat equality in the strong subadditivity 
of quantum entropy. Moreover, we obtain conditions for equality in a large class 
of related convexity inequalities, show that they are independent of p in the range 
< p < 2, and show that for inequalities involving t4 12 3 they are identical to the 
equality conditions for strong subadditivity (SSA) of quantum entropy give in [T3] 

These equality conditions are non-trivial and have found many applications in 
quantum information theory. For example, they play an important role in some 
recent "no broadcasting" results; see [IB] and references therein. They also plays a 
key role in Devetak and Yard's [9] "quantum state redistribution" protocol which 
gives an operational interpretation to the quantum conditional mutual information. 

Our approach to proving joint convexity of relative entropy is motivated by 
Araki's relative modular operator [5], introduced to generalize relative entropy to 
more general situations including type III von Neumann algebras. It was subse- 
quently used by Narnhofer and Thirring [28] to give a new proof of SSA. The argu- 
ment given here is similar to that in [T7J [30j [36] ; however, the unified treatment for 
< p < 2 leading to equality conditions, is new. Moreover, a dual treatment can be 
given for — 1 < p < 1 allowing extension to the full range (—1, 2). 

Wigner and Yanase [HJ HZ| introduced the notion of skew information of a density 
matrix 7 with respect to a self-adjoint observable K, 

-\[K^\[K^\ (1) 

for p = \ and Dyson suggested extending this to p G (0, 1). Wigner and Yanase [12] 
proved that ([1]) is convex in 7 for p = | and, in his seminal paper [19] on convex trace 
functions, Lieb proved joint concavity for p G (0, 1) for the more general function 

(A, B) ^ Tr K*A v KB l - v (2) 

for K fixed and A,B > positive semi-definite. This implies convexity of (OQ) and 
was a key step in the original proof [22] of the strong subadditivity (SSA) inequality 
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of quantum entropy. Moreover, it leads to a proof of joint convexity of relative 
entropjfl as well. It is less well known that Ando [H H] gave another proof which 
also showed that for 1 < p < 2, the function (j2J) is jointly convex in A, B. The case 
p = 2 was considered earlier by Lieb and Ruskai [23]. We modify what one might 
describe as Lieb's extension of the Wigner-Yanase- Dyson (WYD) entropy to a type 
of relative entropy in a way that allows a unified treatment of the convexity and 
concavity of Tr K*A P KB 1 ~ P in the range p G (0,2] and includes the usual relative 
entropy as a special case. Our modification retains a linear term, even for A ^ B. 
Although this might seem unnecessary for convexity and concavity questions, it is 
crucial to a unified treatment. 

Lieb also considered Tr K*A p KB q with p, q > and < p + q < 1 and Ando 
considered 1 < q < p < 2. In Section 12. 2\ we extend our results to this situation. 
However, we also show that for q ^ 1 — p, equality holds only under trivial conditions. 
Therefore, we concentrate on the case q = 1 — p. 

Next, we introduce our notation and conventions. In Section [2], we first describe 
our generalization of relative entropy and prove its convexity; then consider the 
extension to q ^ 1 —p mentioned above; and finally prove monotonicity under partial 
traces including a generalization of strong subadditivity to p ^ 1. In Section [31 we 
consider several formulations of equality conditions. In Section HI we show how 
to use these results to obtain equality conditions in the results of Lieb and Carlen 
[8] . For completeness, we include an appendix which contains the proof of a basic 
convexity result from |36j that is key to our results. 



1.2 Notation and conventions 

We introduce two linear maps on the space of d x d matrices. Left multiplication 
by A is denoted La and defined as La{X) = AX; right multiplication by B is 
denoted Rb and defined as Rb(X) = XR. These maps are associated with the 
relative modular operator Aab = LaR~b" introduced by Araki in a far more general 
context. They have the following properties: 

a) The operators La and Rb commute since 

L a [Rb(X)} = AXB = R B [L A (X)} (3) 

even when A and B do not commute. 

1 In |22j only concavity of the conditional entropy was proved explicitly, but the same argu- 
ment [35[ Section V.B] yields joint convexity of the relative entropy. Independently, Lindblad [25] 
observed that this follows directly from ^ by differentiating at p = 1. 
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b) La and Ra are invertible if and only if A is non-singular, in which case L A X = 
La-i and R^ 1 = Ra- 1 - 

c) When A is self-adjoint, La and Ra are both self-adjoint with respect to the 
Hilbert Schmidt inner product (A,B) = Tr A*B. 

d) When A > 0, the operators L A and R A are positive semi-definite, i.e., 

TrX*L A (X) = TrX*AX > and 
TrX*i? j4 (X) = TrX*XA = Tr XAX* > 0. 

e) When A > 0, then (L j4 ) p = L AP and (_R a )p = R A v for all p > 0. If A is also 
non-singular, this extends to all p G R. More generally /(La) = -^/(A) f° r 
/ : (0, 00 t-»)R. 

To see why (e) holds, it suffices to observe that A > implies L A and Ra are linear 
operators for which f(A) can be defined by the spectral theorem for any function / 
with domain in (0, 00). It is easy to verify that A\<pj) = (Xj\(f)j) implies LA\4>j)(4>k\ = 
aj\4>j) (4>k\ for k — 1 ... d so that the spectral decomposition of A induces one on La 
with degeneracy d and f{L A )\(t>j){(t>k\ — F° r Rb a similar argument 

goes through starting with left eigenvectors of B i.e., {<j)j\B = /3j{<j>j\- 

If a function is homogeneous of degree 1, then convexity is equivalent to subad- 
ditivity. Thus, if F(XA) = XF(A), then F is convex if and only if F(A) < ^ . F(AA 
with A = ^\ Ay We will use this equivalence without further ado. 

We will encounter expressions involving commuting positive semi-definite matri- 
ces A, D with kerD C ker A We will simply write AD -1 for 

UmVA(D + eI)- 1 VA= I^U A)x v^U A)0 on (ker A) ^ 
^0 on ker A 

For B positive semi-definite, we denote the projection onto (ker B) 1 - by ^(kcrB)^- 



2 WYD entropy revisited and extended 

2.1 Generalization of relative entropy 

We now introduce the family of functions 

/ x \^T—^ x - xP ) P^ 1 
I x log x p — 1 



which are well-defined for x > and p 7^ 0. For our purposes, it would suffice to 
consider p £ [|, 2]. For A, B strictly positive we define 

J P (K,A,B) = Tr^BK* g p (L A R^)(KVB) (6) 

' (Tr tf*Aflf - Tr K*A v KB x - p ) p £ (0, 1) U (1, 2) 

Tr # #M log A - Tr if* AK log 5) p = 1 (7) 

-\(TrK*AK -TrAKB~ l K*A p = 2 

When p = 1 and K = I, reduces to the usual relative entropy, i.e., 

Ji(i",A,B) = ff(AB) = TrA(logA-logfi) (8) 

For p ^ 1, the function J p (K, A, B) differs from that considered by Lieb [TH] and 
Ando (21 II] by the seemingly irrelevant linear term Tr K*AK and the factor p ^_ p ^ ■ 
However, this minor difference allows us to give a unified treatment of p £ (0, 2] 
because of the extension by continuity to p = 1 and the sign change there. 

One might expect to associate the exchange A -h- B with the symmetry p <-> 
(1 — p) around p = \- However, this is problematic at p — 1. Therefore, we use 
instead the observation that 

J p (K*,B,A) = Tr VI Kg p (L B R^){K*VA) 
= Tr VBK*g^ p (L A R^)(VB) 
= J X ^ P {K,A,B) (9) 



where, for — 1 < p < 1, we define 



1 - X P) p^Q 



g p (x) = xg l _ p (x- 1 ) = {rt\-ri y ' yr (10) 

I — log x p = 

and J P (K,A,B) = Tr ^BK* g p (L A R^) (Ky/B). 

The functions J P (K, A, B) and J P (K, A, B) have been considered before, usually 
with if = /, in the context of information geometry. (See [21 Section 7.2] and 
references therein.) What is novel here is that we present a simple unified proof of 
joint convexity in A, B that easily yields equality conditions, shows that they are 
independent of p and can be extended to other functions. 

When K = K*, the relation 

J P (K,A,A) = - —^—^ Tr[K,A^][K,A^) (11) 
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yields the original WYD information (up to a constant) and extends it to the range 
(0,2]. Morevoer, K = K* implies that J P (K,A,A) = J^ P (K,A,A) = J p (K,A,A). 
Observe that although neither g p {w) nor g p (w) is positive, their average G p {w) = 
\[g{w) + wg{w~ 1 } > on (0, oo). Therefore, when K = K*, 

J p (K,A,A) = Tr (K VA)*G P {L A R {Ry/J) > (12) 

The function J P (I, A, B) is a more appealing generalization of relative entropy 
than TiA p B 1 ~ p because of Proposition [TJ which one can consider a generalization 
of Klein's inequality [IB]- It allows one to use J P (I,A,B) as a pseudo- metric, as is 
commonly done with the relative entropy. 

Proposition 1 When U is unitary and A,B>0 with Tr A = Tr B = 1, then 
J P (U, A,B)>0 with equality if and only if A = U*BU . 

Proof: When U is unitary, 

J P (U, A, B) = J P {I, UAU*, B) = J P {I, A, U*BU). (13) 

Therefore, it suffices to consider the case U — I. For p 6 (0, 1) Holders inequality 
implies Tr A p B l ~ p < (Tr A) p (Tr B) 1 ~p = 1 with equality if and and only A = B. It 
immediately follows that 

J p (I,A,B)>^(TiA-l)=0 and J p (I, A, B) = ^ A = B. (14) 

For p — 1, the result is well-known [371 Section 2.5.2] and originally due to O. Klein 
[16]. For p G (1, 2) we write p = 1 + r and again use Holder's inequality 

1 = TrA = TrAB'^B^ 

< \\(AB~^t\\ 1+7 .(TtB)t^ (15) 

< (Tr A 1+r B~ r ) ^ 

where we used Tr B = 1 and the second inequality follows from a classic result of 
Lieb-Thirring [2H Appendix B, Theorem 9] in the form given by Simon [3EJ Theorem 
1.4.9]. QED 

Because the denominator p(l —p) changes sign at p = and p = 1, both g and g 
are convex. In fact, they satisfy the much stronger condition of operator convexity 
for p G (0, 2] and p G [—1, 1) respectively. Since g(0) = and 

9_M = l^-- p - 1 ) p*\ (16) 

x 1 log a; p — 1 ' 
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it follows that g p (x)/x is operator monotone [31 QUI [25], for p G (0, 2], i.e., g p can be 
analytically continued to the upper half plane, which it maps into itself By applying 
Nevanlinna's theorem [fl Section 59, Theorem 2] to g(x)/x, one finds that g(x) has 
an integral representation of the form 

g p (x) = ax + I - v[t) dt 



ax + 



x + t 
- x 2 



1 



tu(t) dt 



(17) 



.X + t t X + t. 

with u(t) > 0. Integral representations are not unique, and making a suitable change 
of variable in the classic formula 

1 



oo x p-l 



71 



x + 1 sinp7r c p 



P e (0,1) 



(18) 



allows us to give the following explicit representations 

+ c pJo C °(^Tt- 1 ) tP ~ ldt \ pe(o,i) 



p(i-p) 



JO Vx+t 1 ^ x+t) l+t ai 



1 



p(l-p) 



x — c 



p- 1 Jo 



Jo 



pe (1,2) 



(19) 



Note that for p e (0, 2) the integrand is supported on (0, oo). This plays a key role 
in the equality conditions; therefore, we will henceforth concentrate on p 6 (0,2). 

Theorem 2 The function J P (K, A, B) defined in ([9]) is jointly convex in A,B. 

Proof: It follows from ( fl7|) that 

J p {K,A,B) = aTr K*AK (20) 

1 



+ 



Ti K*A 



~L A + tR 



B 



\AK) 



Tr KBK* 
t 



TtBK*- 



La + tR B 



(KB) tv{t)dt 



The joint convexity then follows immediately from that of the map (X, A, B) \— > 
Tr X* T jrs (X) which was proved in [SB] following the strategy in [23]. The proof 



L A +tR B 

is also given in the Appendix. QED 

For other approaches see Petz [291 [30], Eftros [TTJ, The advantage to the argument 
used here is that it immediately implies that equality holds in joint convexity if and 
only if it holds for each term in the integrand. 



Corollary 3 The relative entropy H(A, B) = J\(I, A, B) is jointly convex in A,B. 
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2.2 Extensions with r ^ 1 — p. 



We now consider extensions of Theorem [2] to situations in which B l ~ p is replaced 
by B r with r ^ 1 — p, using an idea from Bekjan [6] and Effros [11] . We will also 
show that equality holds in these extensions only under trivial conditions. For this 
we first need an elementary lemma. 

Lemma 4 Let f(X) : [0, oo) i— > R be a non-linear convex or concave operator func- 
tion, let Ai, A 2 be density matrices and A = XA\ + (1 — X)A2 with A G (0, 1). Then 
f(A) = A/(Ai) + (1 - X)f(A 2 ) if and only if A l = A 2 . 

Proof: Since any operator concave function is analytic, non-linearity implies that / 
is strictly concave. If f(A) = A/(Ai) + (1 - X)f(A 2 ), then 

(v, f(A)v) = X(v, f(A 1 )v) + (1 - X)(v, f(A 2 )v) (21) 

for any vector v. Now choose v to be a normalized eigenvector of A. Then inserting 
this on the left above and applying Jensen's inequality to each term on the right, 
one finds 

f((v,Av))<Xf((v,A lV )) + (l-X)f((v,A 2 v)) (22) 

But this contradicts concavity unless equality holds, which implies that v is also 
an eigenvector of A\ and A 2 . But then the strict concavity of / also implies that 
(v,A±v) = (v,A 2 v). Since this holds for an orthonormal basis of eigenvectors of A, 
we must have A\ = A 2 . 

Corollary 5 The function (A,B) h-> TrK*A p KB r is jointly concave on the set of 
positive definite matrices when p, r > and p + r < 1 . Moreover, when p + r < 1 
and K is invertible, the convexity is strict unless B\ = B 2 and Ai = A 2 . 

Proof: It is an immediate consequence of Theorem [2] that (A, B) y Tr K*A P KB 1 ^ P 
is jointly concave in A,B. Now write Tr K*A p KB r = Tr K*A p K(B s ) l - p with s = 
r/(l —p)- First, observe that for < s < 1 the function f(x) = x s satisfies the 
hypotheses of Lemma HI Therefore, 

(XBi + (1 - X)B 2 ) S > \B{ + (1 - X)B S 2 (23) 

with < A < 1 and B\ 7^ B 2 . The operator monotonicity of x 1— > x l ~ p for < p < 1 
then implies 

(XB 1 + (1 - X)B 2 ) r > (XBI + (1 - X)B*y- p , (24) 
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and the joint concavity of Tr K*A P KB 1 p implies 

Tr K*A p K(B a ) 1 ~ p > Tr K*(XA 1 + (1 - \)A 2 ) p K{\B s l + (1 - X)B^f' p (25) 
> ATr K*A\KBl {1 ' p) + (1 - A)Tr K* A P 2 K B s 2 (1 ~ p) 

where A = Av4i + (1— X)A 2 , B = XBi + (l — X)B2, which is precisely the joint concavity 
of Tr K*A p KB r . Moreover, equality in joint concavity implies equality in (125]) and, 
since K*A P K is strictly positive, this implies equality in (|2"5|) . Therefore, equality in 
(T55J) gives a contradiction unless B\ = B 2 . In that case, the joint concavity reduces 
to concavity in A for which, by a similar argument, equality holds if and only if 
A 1 = A 2 . QED 

Corollary 6 The function (A,B) h-> Tr K*A p KB 1 - r is jointly convex on the set of 
positive definite matrices when 1 < r < p < 2. Moreover, when r < p and K is 
invertible, the convexity is strict unless B\ = B 2 and A\ = A 2 . 

Proof: The argument is similar to that for Corollary [51 Write Ty K*A p KB r = 
TiK*A p K(B s ) 1 - p with s = f^. Since s G (0,1) and 1 - p G (-1,0) when 1 < 
r < p < 2, it follows that x s is operator concave and x x ~ p is operator monotone 
decreasing. QED 

2.3 Monotonicity under partial traces 

Let X and Z denote the generalized Pauli operators whose action on the standard 
basis is X\ek) = \ek+i) (with subscript addition mod d) and Z\ek) = e t2nk ^ d \ek) ■ It 
is well known and easy to verify that ^ Y2k Z h AZ~ k is the projection of a matrix 
onto the diagonal ones. If D is a diagonal matrix, then ^2, k X k DX~ k = (Tr D)I. 
Now let {W n } n= i j2 ^d 2 denote some ordering of the generalized Pauli operators , e.g., 
W j+k{ ^ x) = X> Z k with j, k = 1, 2 . . . d. Then \ J2 n W n AW* n = (Tr A)I and 

2 ^{W n ® I 2 ) A 12 (W n ® I 2 )* =h® (Tn A)=h® A 2 (26) 

n 

Using the fact that replacing W n by UW n U* with U unitary, simply corresponds 
to a change of basis which does not affect f[2"6"j) and then multiplying both sides by 
U* ® I 2 on the left and U ® I 2 on the right gives the equivalent expression 

i ^(W n [T <g) J 2 ) A 12 (W n C/* <g) J 2 )* = /! ® A 2 (27) 

n 

Combining this with joint convexity yields a slight generalization of the well-known 
monotonicity of relative entropy under partial traces (MPT), first proved by Lieb in 
[19] for the case K 12 = h ® K 2 . 
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Theorem 7 Let J p be as in (J7J, A i2 , B\ 2 strictly positive in M dl eg) M rfa an<i i^i 2 = 
Vi ® wi/i Vi unitary in . TTien 

J P (K 2 , A 2 , B 2 ) < J P (K 12 , A 12 , B l2 ) (28) 

Proof: Writing W n for W n ® J 2 and using (127)) gives 
J P {K 2 ,A 2 ,B 2 ) = ±J p {h®K 2 ,h®A 2 ,h®B 2 ) 



n 

= J p {V 1 ®K 2 ,A 12 ,B 12 ) 

where the final equality follows from the unitary invariance of the trace. QED 
Because Tr 12 (K <g> K 2 )A 12 (V! g> K 2 )* = Tr 2 iC 2 A 2 .K|, ([28]) is equivalent to 

TrK*A p 2 K 2 B^ - Tr ® K 2 )*A{ 2 (Vi ® K 2 )B\ 2 ' P ^ G ^ • (29) 

[<0 p6(l,2) 

We can obtain a weak reversal of this for p e (0, 1). The argument in the Appendix 
shows that for any p and fixed A, B > both Tr K* A P KB X ~ P and Tr K*AK are 
convex in .fT. This was observed earlier by Lieb [19] and also follows from the 
results in [23]. One can then apply the argument above in the special case A\ 2 = 
Ii ® A 2 , B\ 2 = l\® B 2 to conclude that 

Tr K 2 *A P 2 K 2 B 1 2 ~ P < ±Tr K* l2 {h®A 2 yK 12 {h®B 2 ) l - p (30) 
< Tr K* l2 (h®A 2 yK X2 {h®B 2 ) l - p (31) 

independent of whether p < 1 or p > 1. However, because the term Tr K*AK is 
convex rather than linear in i^, (130]) does not allow us to draw any conclusions about 
the monotonicity of J P (K 12 , Ii ® A 2 , I\ ® B 2 ). 

To prove Theorem 12.31 we showed that joint convexity implies monotonicity; the 
reverse implication also holds. Let A%, . . . , A m , B±, ... , B m be positive definite ma- 
trices in Md, A = J2j Aj, B = ^ . Sj, and put 

A12 = l e i)( e il ® A i' = J2 l e i)( e il ® ^' ( 32 ) 

i j 

for ei, . . . , e m the standard basis of C m . Then A\ 2 and 5i 2 are block diagonal, and 
A 2 = Triv4i 2 = ^2 k Ak = A and similarly for B. Then if monotonicity under partial 
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traces holds, one can conclude that 

J P {K,A,B) = J P (K 2 ,A 2 ,B 2 ) 

< J p (I 1 ®K,A 12 ,B 12 ) = J2MK,A j ,B j ) (33) 

3 

Thus, monotonicity under partial traces also directly implies joint convexity of J p . 
Applying ( )28|) in the case K = I, and A\ 2 \-t Ai 23 and B\ 2 h> A\ 2 ® I 3 gives 

Jp(h2, A 23 , A 2 g> I 3 ) < J P (I 123 , A 123 , A 12 ® 7 3 ) (34) 

When p — 1, it follows from ([7]) that 

J!(/ 12 , A 23 , A 2 ® J 3 ) = #(A 23 , A 2 g> J 2 ) = -S(A 23 ) + S(A 2 ) 

where 5(A) = — Tr A log A. Thus, ( 134")) becomes 

-S(A 23 ) + S{A 2 ) < -S{A 123 ) + S(A 12 ) 

or, equivalently 

S(A 2 ) + S(A 123 ) < S(A 12 ) + S(A 23 ) (35) 
which is the standard form of SSA. 

3 Equality for joint convexity of J P (K, A, B). 

3.1 Origin of necessary and sufficient conditions 

Looking back at the proof of Theorem [21 we see that for p 6 (0, 2), equality holds in 
the joint convexity of J P (K, A, B) if and only if equality holds in the joint convexity 
for each term in the integrand in ( fl7l) . It should be clear from the argument given 
in the Appendix, that this requires Mj = for all j with Mj given by ( 170l) . This is 
easily seen to be equivalent to 

(L Aj + tRB.y^Xj) = (La + tR B )-\X) for all j. (36) 

with Xj = AjK and/or Xj = KBj. By writing AK = La{K) in the former case 
and KB = Rb(K) in the latter we obtain the conditions 

(I + tA A ] B] )- 1 (K) = (I + tA AB )- 1 (K) Vj Vt>0 (37a) 
(A AjBi +tI)- 1 (K) = (A AB + tI)- 1 (K) Vj Vt>0 (37b) 
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From the integral representations ( II 9p . one might expect it to be necessary for either 
or both of (137aj) and fl37bj) to hold depending on p. In fact, either will suffice because 
(I37a|) holds if and only if ( 137bj) holds. Because A AB is positive definite, by analytic 



continuation (137bj) extends from t > to the entire complex plane, except points 
—t on the negative real axis for which t e spectrum(A J 4s). Therefore, by using the 
Cauchy integral formula, one finds that for any function G analytic on C\(— oo, 0] 
G(A AjB .)(K) = G(A AB )(K). 

Theorem 8 For fixed K , and A = ^ . Aj, B = Bj, the following are equivalent 

a) J P (K, A, B) = J2j J P (K, ^, Bj) for all p G (0, 2). 

b) J P {K, A, B) = £\ J P {K, Aj, Bj) for some p E (0, 2). 

c) (A AjBj + tI)- l (K) = (A AB + tI)'\K) for all j and for all t > 0. 

d) AfKBp 1 = A it KB- it for all j and for allt>0. 

e) (log A - log Aj)K = If (log .B - log Bj) for all j . 

Proof: Clearly (a) =^ (b). The implications (b) =>■ (c) =>- (d), as well as (b) ^> 
(a), follow from the discussion above. Differentiation of (d) at t = gives (d) =3- 
(e), and it is straightforward to verify that (e) =>■ (b) with p = 1. Moreover, (d) 
implies £\ Tr K*AfKBj~ u = Tr K*A it KB 1 ~ it for all t, which implies (a) by analytic 
continuation. QED 



3.2 Sufficient subalgebras 

When K = I,we can obtain a more useful reformulation of the equality conditions by 
using results about sufficient subalgebras obtained in [TU [15l [32] . Since the definition 
and convexity properties of J P (I, A, B) extend by continuity to positive semidefinite 
matrices, with ker B C ker A, we will formulate the conditions in this more general 
situation, using the conventions in Section 11.21 

Let iV C Md be a subalgebra, then there is a trace preserving conditional ex- 
pectation E N from M d onto N, such that Tr AX = Tr E N (A)X for all X e N. In 
particular, if N = M dl ® I C M dl ® M da , then we have Ejv(Ai 2 ) = Tr 2 A ® ±1. 

Let Qi, • • • , Qm ^ and assume that ker Q m C ker for all j. The subalgebra 
is said to be sufficient for {Qi, . . . ,Q m } if there is a completely positive trace 

preserving map T : iV — > Md, such that T(Ejf(Qj)) = Qj for all j = l,...,m. 

This definition is due to Petz [32j [31] and it is a quantum generalization of the 

well known notion of sufficiency from classical statistics. In |32j, it was shown that 

sufficient subalgebras can be characterized by the condition 

H(Qj, Qm) = H(E N (Qj), E N (Q m )), for all j 
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We combine this with the results of the previous section to obtain other useful 
characterizations of sufficiency. 

Theorem 9 Let Qi, ■ ■ ■ ,Q m G M£ be such that kerQ m C ker Qj for all j. Let 
N Q The following are equivalent. 

(i) N is sufficient for {Q\, . . . , Q m }. 

(ii) E N {Qj) it E N {Q m )' it P (}ssT Q m) ± = QfQ^ u , for all 3, t G R. 

(Hi) There exist Q?,o G N + , and D G M£ ', such that ker D = kerQ m , and 
Qj = Q jfi D forj = l,..., m. 

(iv) J p (I,Qj,Q m ) = J p (I,E N (Q j ),E N (Q m )) for all j and some p G (0,1) 

The proof of the conditions (i) - (iii) can be found in [H] , see also |27J . The condition 
(iv) was proved in [T5] . 

3.3 Equality conditions with K = I 

Theorem 10 Let A\, . . . , A m and Bi, . . . , B m be positive semi- definite matrices with 
ker Bj C ker Aj, and let A = £V Aj,B = Bj. Then the following are equivalent. 

a) J P (I, A, B) = £\ J P (I, A,, Bj) for all p G (0, 2) . 

b) J P (I, A, B) = £. J P (I, Aj, Bj) for some p G (0, 2). 

c) AfBp 1 = A it B- it P {]lBlB . ) i. for all j andteR 

d) There are positive matrices D\, . . . , D m , with ker Dj = ker Bj, such that 
[Aj, Dj] = [Bj, Dj] = 0, and with D = J2j Dj 

Aj = AD~ l Dj, B 3 = BD^Dj (38) 

Proof: As in Section EU (b) implies (ESJ on (kerS^, with X d = B j: X = B. This 
gives 

(A AjB . + tl)-\l) = (A AB + tl)-\l) on (ker 5^. (39) 

Then (c) follows from the Cauchy integral formula as in Section 13.11 

To show (c) implies (d), we will use Theorem [9j First let N = I®Md C M m ®Md 
and let A\ 2 , B\ 2 be the block-diagonal matrices in M m ®Md, defined by (l3"2~j) . Clearly, 
we have ker A 12 2 ker B 12 = J2j \ e j)( e j\ ®ker ^ and E N (A 12 ) = ~J <8> A E N (B 12 ) = 
i/ <g> B. Then (c) implies E N (A\ 2 ) U E N (B 12 )~ U P^ kcr , ^ ± = Af 2 B^ for all t. Then 
by using Theorem |9] with Q\ = A\ 2 , Q m = Q 2 = B\ 2 , we can conclude that there are 
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positive matrices A , B e M d and D 12 G (M m <g> M d ) + , such that ker D u = ker £> 12 , 
[J ® A , D 12 ) = [I <8> B , £12] = and 

Au = (I <g> A )Di2, £12 = (I ® 5 )£)i2 (40) 

Since Ai 2 , -B12 are block diagonal, D\ 2 = J2j \ e j){ e j\®Bj must also be block diagonal 
with Dj e M+, kerDj = ker Bj, [A , Dj] = [B Q , Dj] = for all j and 

Aj = A Dj, Bj = B Dj. (41) 

Taking Tri in (HDD gives A = A D and B = B D. Using this in flU]) gives ([MD which 
proves (d). The implications (d) =>■ (a) =>- (b) are straightforward. QED 

We return briefly to the case of arbitrary K. Note that if the condition (d) holds 
and [Dj,K\ = for all j, then J p {K,A,B) = J2j J p( k , An B i) for a11 P e (°' 2 ), 
this gives a sufficient, but not necessary, condition for equality if K ^ I. The next 
result reduces the case of K unitary to K = I. Then, we can apply the conditions 
of Theorem [10] to Aj and KBjK*. 

Theorem 11 If K is unitary, then J P (K, A, B) = ^2,. J P {K, Aj,Bj) if and only if 
J p (I,A,KBK*) = J2 j J p (I,A j ,KB j K*) ^ 

Proof: When K is unitary, then KB P K* = (KBK*) P which implies J P (K, A, B) = 
J P {I,A,KBK*). QED 

One can try to extend the results of this section to the case \\K\\ < 1, and hence 
to all K, by using the unitary dilation 

where L = U(l — IK] 2 ) 1 / 2 and K = U\K\ is the polar decomposition. Then, with 

A=( A °) B=( B ° 

we have J P (K, A, B) = J P (U, A, B), so that we may use Theorem[TT]to get conditions 
for equality. But note that the conditions of Theorem [TD] require that ker UBjU* C 
kerA,- and it can be shown that this implies P^crA^^KP^B.^K* — -P(ker a^ 1 - , 
where Pa/ denotes a projection onto the subscripted space. In particular, if all Aj 
and Bj are invertible, this restricts us to unitary K. 
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3.4 Equality in monotonicity under partial trace 

It is easy to see that when A\ 2 = A\ <g> A2 and Bi 2 = B\® B2, then J P (I, A\ 2 , B12) = 
J P (I, A2, B2) if and only if A\ = B\ with TrAi = 1. However, it is not necessary 
that A\2 = A\ £g> A2. The equality conditions are given by the following theorem. 

Theorem 12 Let K = I and A 12 ,B 12 G B(Ui g> "H 2 ) + , with ker B 12 C ker A 12 . 
Equality holds in (1281) i/ and on/y i/ 

(i)U 2 = ® n U L n ®U*. 

(U) A 12 = ® n A^ ® Af ujitfi G g> H£) + and Af G + 

fmj ^12 = © n ^ ® S« witfi B£ e ® ^n) + 5 n e ) + 
^ A£ = 5^ /or a// n 

Proof: Let us denote A i = j-WjA 12 Wj, B i = ±WjB l2 Wj, with Wj defined as in 
the proof of Theorem Then we get that equality in f )28|) is equivalent to 

j P (ii2, E ^ E ^) = E J ^> 4. s i) 

i i i 

By Theoremfini equality for some p implies equality for all p, so that J P (I, -B12) = 
J P {I, Tri A, Tri 5) = J p (I, ^(^12), E N {B l2 )) for p G (0, 1), where N is the subalge- 
bra Ji ®B((H.2) Q B(%i <8> Hence iV is sufficient for {A12, -B12} and, by Theorem 
[9j there are some A Rl B R G B(V,2) + and £> G i3("Hi <g> "H2) + , ker£> = keri?i 2 , such 
that <g> A K ), £>] = ® Sr), D] = and 

A l2 = D(h ® S12 = £>(/i ® (42) 

Now let Mi be the subalgebra in B(U 2 ), generated by A R , B R . Then D G (J^Mi)' = 
B(TLi) <8> M[ where M' denotes the commutant of M. There is a decomposition 
% = n Un ® n ni such that 

M[ = ) ® 1* M 1 = 01^ 

n n 

and -D = n D n Cg) 1* where £> n G ® Since A R ,B R G Mi, we get the 

result, with A^ = B% = D„. The converse is can be verified directly QED 

Applying this result in the case A2 l— A 12 3 and B 12 ^ A 12 CS> I3 gives equal- 
ity conditions in (1341) . Since these are independent of p, they are identical to the 
conditions, first given in [15] . for equality in SSA (1551) which corresponds to p = 1. 

Corollary 13 Equality holds in ( 134|) and on/y z/ 

(i) n 2 = @ n u L n ®u*. 

(ii) A123 = 0„ A^ <g> A* m& A^ g B(7£! <g> U L n ) and A* g B{H% <g> ft 3 ) 
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Proof: It suffices to let A 12 — > A 123 and B 12 — > A 12 <8> / 3 in Theorem [T2J QED 

To apply these results in Section HJ it is useful to observe that condition (ii) in 
Corollary [13] above can be written as 

A=(F L ®I 3 )(I 1 ®F R ) (43) 

with F L G B{Hi ® H 2 ) + , F R G B(U 2 <g> "H 3 ) + , [F L <g> I 3 ,h g> Fr] = 0. Combining 
this with part (d) of Theorem UM gives the following useful result, which essentially 
allows us to bypass the need to apply Theorem [TU1 to J P (I, Aj, W n AjW n ) . 

Corollary 14 Let Aj G M dl <g> M da , A = ]T Aj. Then 

J P (I 12 , A, (Ty 2 A) ®I 2 ) = J2 J p( /i2 ' A i' ( Tr2 ^) ® /2 ) ( 44 ) 

j 

i/ and only if there are Dj G , sttca £aa£ ker Dj = ker T^A,, [Aj, Dj ® J] = and 

,1, = '/;,- <8> /) (/•//// d = v ; d,-. 

Proof: Let A 123 = £\ 1%) (ej| ® ^ G M m ® M dl ® M d2 , then A = A 23 G M rfl <g> M d2 
and (jUJ) can be written as 

dp(/ 2 3, A 23 , A 2 ® / 3 ) = ^(^123, Ai2 3 , l i2 ® 7 3 ) 

By (|4~3]) . this is equivalent to the existence of F L and Fr, [(F l <S> I3), (I\ ® F R )\ = 0, 
such that Ai 23 = (Fl <S> h)(h ® F R ). Since A(i)(2 3 ) is block-diagonal, Fl must be of 
the form F L = J2j \ e j)( e j\ ® ^ so tliat = Fr^ ® I). But taking Tn of fl43l) 
gives A = (D ® I 3 )F R = F R (D ® J 3 ) so that A,- = A(D- 1 D j <g> I). QED 

4 Equality in joint convexity of Carlen-Lieb 

Carlen and Lieb [8] obtain several convexity inequalities from those of the map 

T m (K, A) = Tr {K*A p K) q/p . (45) 

using an identity which we write only for q = 1 and p > 1 in our notation as 

T Ptl (K, A) = (p- 1) inf { J P {K, A, X) + ±Tr X + ^__Tr K*AK : X > 0} (46) 

We introduce the closely related quantity 

f Ptl (K,A) = M{j p (K,A,X) + 1 -TrX:X>0} (47) 
= (^PpAK,A)-±TrK*AK) (48) 
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which is well-defined for all p G (0, 2) and allows us to continue to treat the cases 
p < 1 and p > 1 simultaneously, as well as include the special case p = 1 for which 

T hl {K,A) = -TriTA^logfTTAFf) + Tr ir*(Alog A)^ 

= S^AK") + TrinrMlog,4 (49) 

Since we are dealing with finite dimensional spaces, the infimum in (j46p has a 
minimizer which satisfies 

X min = (K*A p K) 1 / p . (50) 
For fixed X, let Xj denote the minimizer associated with Aj. Then 

f p>1 (K,A l )+f p>1 (K,A 2 ) = J p (K,A 1 ,X 1 ) + ±TrX 1 + J p (K,A 2 ,X 2 ) + ±TrX 2 

> J p (K,A 1 + A 2 ,X 1 + X 2 ) + ±Tr(X 1 +X 2 ) (51) 

> mf{j p (K,A 1 + A 2 ,X) + ±TrX :X>0} 

= f pA (K,A 1 + A 2 ) (52) 

which proves convexity of T P) i. Note that equality above requires both X = ^2jXj 
and J P (K, A, X) = ^\ J P {K, Aj, Xj). where X is the minimizer associated with A. 

Now we introduce some notation following the strategy in the published version 
of [8]. Let |1) denote the vector (1,1,..., 1) with all components 1 and \e\) the 
vector (1, 0, . . . , 0). Define 



K = \l® \t)(e x 



and 



Aj = ^Ajk ® \e k )(e k 

k 

and A = £\ Aj = Y.k A k® \ e k) ( e k\ with A k = J2j A jk- Then 

IC*A P IC =(J2A p k )® |ei)( ei | 





// 


. 


.. 0) 






J 


. 


.. 






V 
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.. 0) 




Ml 








... o\ 







A j2 





... 










4*3 


... 


V 








'•■ 7 



(53) 



(54) 
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With this notation, we make some definitions following Carlen and Lieb but modified 
to allow a unified treatment of p 6 (0, 2). 

^ M (A) = ^ M {A 1 ,A 3 ,A 3 ...) = Tr(A p 1 + A p 2 + A p 3 + ...) 1/p (55) 
$ (Pl i ) (^) = $ (p ,i)(A 1 ,A 2 ,A 3 ...) ee T P>1 (£,.A) (56) 



i 



(p-i) 



p 

k 



The definitions of $ and $ apply only when A is a block diagonal matrix in M^CgiM^. 
We now extend this to an arbitrary matrices A12 G M dl eg) M rf2 . 

*( P ,i)(A 2 ) ee T ri (Tr 2 ^ 2 ) 1/p (57) 

$( P ,i)(A 2 ) ee ^^ (Pil) (^ 12 )-iTr^ 12 ] (58) 
For p — 1, the formulas with hats reduce to the conditional entropy 

$(i,i)(A,A 2 ,A 3 ...) = Tr(V .l.jlog^V^l,) • V r l,log,l ; 

= S(J2 j A j )-S(Ax 2 ) = Ji(l,A 1 2,Tr 2 A 12 ®I 2 ) (59) 

#(i,i)(A 2 ) = 5(^0- 5(^12) = ^ 1 2, A ®/ 2 ) (60) 

When A12 is block diagonal, ^^^(A) = $(v,i){A) with the understanding that 
Tr 2 *4 = J2k Ak- Now let W n denote the generalized Pauli matrices as in Section [231 
W n = h <S> W n and define 

A 123 = Y,W n AW* n ® |e n )(e n | (61) 

n 

so that A123 is block diagonal with blocks W n A 2 VV*. Then 

i+p 

d 2 " ttfoijGAia) = $(Ai2)(3)) = $(Wi^li 2 >Vr, W 2 A 2 W 2 *, . . .)• (62) 



It is straightforward to show that for p e (0, 2) the functions $( p l )(^4) and ^/( p l )(^4) 
are all convex in A, inheriting this property from the quantities from which they 
are defined. In view of (|59|) and (160]) . the conditions for equality in the next two 
theorems are not surprising. 

Theorem 15 The function $(n,i)(A) is convex in A for p G (0,2). Moreover, the 
following are equivalent 

(1) J P (I, A, (Tr 2 A) ® I 2 ) = Ej J p( J . A> ( Tr 2 A) ® h) 

(ii) There are matrices Dj > 0, D = Ej-^j> such that [Aj^,Dj] = 0, ker Dj = 
ker (Efc A jk) and A jk = A k D~ l Dj. 

(m) $ (P ,i)(A, a 2 , a 3 ...) = E, i(p,i)(A-i, Aj-a, As • • •) 
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Proof: It follows from Corollary [TH and the fact that Aj are block-diagonal that (i) 
(ii) and it is straightforward to verify that (ii) =>■ (iii). Moreover, (hi) implies (i) 
for p = 1, by f )59|) . To show that (iii) implies (ii) for p ^ 1, observe that (iii), implies 
T Pi i(/C,^4) = J2j T P) i(/C, A,-), and this implies 

J p (/C, ^, AT) = J] ■/„(£, Aj, X 3 ) (63) 
j 

where Xj = {K,*A p 1C) 1Ip = Xj ® |ei)(ei| and £\ A} = A' = (/CU p /C) 1 / p = 
X ® | ei )(ei|, with X,- = (£ fc ^ fc ) 1/p and X = (£ fc ^) 1/p - Since 

ica^kx^ = ® | ei )( ei |, 

with a similar expression for K.*A P 1CX 1 ^ P , we find 

/ J J P {Ii A k , X) = J P (JC, A, X) = 2^ Jp(fc, Aj, Xj) = J p (/, Aj-*., Xj) 

Convexity then implies that we must have 

J p (I,A k ,X) = Y f Jp( I , A jk, x i) (64) 
j 

Since kerXj C ker A^, Theorem [TOl implies that 

AtX~ u P {kerX ^ = A%X~ U , for all k,j,t (65) 

After writing Ak = J2j \ e j)( e j \ ® A jk, X = V. |e 3 -)(ej| <8> Xj, this reads 

= & ® Tnl,)^^/ ® Th^F^x, 

so that, by Theorem [TDJ there are elements 5^ G and D G (M m ® M d ) + , such 
that [(I®B k ), D} — and A fc = (I®B k )D. As before, one finds D = Y,j \ e j)( e j\® D j 
for some Dj G Mj" which implies (ii). QED 

Theorem 16 The function ^(0,1) (^-12) 2s convex in An forp G (0,2). Moreover, if 
we let A123 denote the block diagonal matrix with blocks WnAW*, the following are 
equivalent 

(1) J p {I,Ai23,Ax®h?) = Y,j Jp(l, (Am)j, {Ai)j®hz) withAi23 defined by ([62]). 

(ii) There are matrices Dj G M^, D = ^2jDj, such that [Aj,Dj ® J] = and 
A j =A(D- 1 D j (S)I). 
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1+p 

Proof: It follows from the definition of A123, that d 2 v ^(p,i)(A) = §(Ai 23 ). The 
equivalence (i) (iii) follows immediately from Theorem [151 an d (i) (ii) can be 
shown to follow from Corollary [TH QED 

Theorem 17 TTie following monotonicity inequalities hold, 

%,i)(Am) < *(p,i)(^i23), P e (0, 2) (66a) 
*(p,i)(^23) > *(p,i)(^i23), P e (0, 1) (66b) 
^( P ,i)(A 3 ) < ^( P ,i)(A 23 ), P e [1, 2) (66c) 

Moreover, equality holds if and only if the conditions of CorollaryU^ are satisfied. 

Proof: It suffices to give the proof for \I/ since the other inequalities follow immedi- 
ately. The argument is similar to that for Theorem [71 Let W n denote the generalized 
Pauli matrices of Section [231 but now let W n = W n ® I 23 . Then the convexity of 
^(p,i){A 23 ) implies 

*(p,l)(A 23 ) = ^( P ,l)(h®A 23 ) 

= T^(pM^n^nA l23 W n ) 

^ jlJ2^(P^ W nA 12 3W n ) = *(p,l)(^123) 
n 

where we used the invariance of \& under unitaries of the form U\ (g> I 23 . In the case 
p = 1, it follows from JBQD that *(i,i)(^2 3 ) < l)(^m) becomes 

S(A 2 ) - S(A 23 ) < S{A 12 ) - S(A 123 ) (67) 

which is SSA. Because the equality conditions in Theorem [TBI are independent of p, 
they are identical to those for SSA, which are given in Corollary [131 QED 

The Carlen-Lieb triple Minkowski inequality for the case q = 1 is an immediate 
corollary of Theorem [TTJ Observe that 

Tr 3 T ri (Tr 2 „4 p 23 ) 1/p = #(^1) (-4(13) ,(2)) (68a) 
Tr 3 [Tr 2 (Tn^ 12 3) p ] 1/p = tffci)^) (68b) 
so that it follows immediately from (I66b ) that 

Tr 3 [Tr 2 (Tn^ 123 ) p ] 1/p = ^ {p ,i)(A 32 ) < ^ {pA) (A 132 ) = Tr 3 Tn (Tr 2 ^ P 23 ) 1/p (69) 

for 1 < p < 2 and from (166b ) that the inequality reverses for < p < 1. Moreover, 
the conditions for equality are again independent of p and identical to those for 
equality in SSA, given in Corollary [TBI 



20 



5 Final remarks 



It should be clear that the results in Section [2] are not restricted to J P {K, A, B). The 
function g p (x) given in ([6]) can be replaced by any operator convex function of the 
form g(x) = xf(x) with / operator monotone on (0, oo). Moreover, if the measure 
u{t) in ( TT7|) is supported on (0, oo), then the conditions for equality are identical to 
those in Section |3j 

In particular, our results go through with g p replaced by g p and J p (I, A, B) 
replaced by J P (I,A,B), which is well-defined for p G [—1,1) with J (I,A,B) = 
H(B,A). Thus our results can be extended to all p G (—1,2). The case p = 2 
reduces to the convexity of (A, X) i— > Tr X*A~ 1 X with A > proved in [23]. One 
can show that equality holds if and only if Xj = AjT V j with T = A~ 1 X. 

There have been various attempts, e.g., the Renyi [3J] and Tsallis [35] entropies, 
to generalize quantum entropy in a way that gives the usual von Neumann entropy 
at p — 1. In this paper we have considered two extensions of the conditional entropy 
involving an exponent p G (0, 2), namely, 

• J P (I, A 12 , Ai) which gives ^^23^2 p ^ Tr74^23^4i2 P ^ c li'oi an d can be 
thought of as a pseudo- metric; and 

. tffeijGAu) which gives Tr 2 (Tr 3 A p 23 )^ | Tr^T^s)^ p | 2) &nd 
can be thought of as a pseudo-norm. 

These expressions are quite different for p ^ 1, but arise from quantities with the 
same convexity and monotonicity properties, as well as the same equality conditions 
which are independent of p. Moreover, both yield SSA at p — 1 and the equality 
conditions for p ^ 1 are identical to those for SSA. This independence of non-trivial 
equality conditions on the precise form of the function seems remarkable. 

If one uses g p and J P (I,A,B) from fflOl) . then the inequalities above hold with 
p G (1, 2) replaced by p G (—1, 0) and SSA corresponds to p — 0. 



21 



A Proof of the key Schwarz inequality 



For completeness, we include the proof of the joint convexity of (A, B, X) i— > 
TiX*(L A + tRB)~ l {X) when A,B>0 and t > 0. Since this function is homo- 
geneous of degree one, it suffices to prove subadditivity. Now let 

M j = (L Aj + tR B] y^{X 3 ) - (L Aj + tR B .f/\K). (70) 

Then one can verify that 

< ]TTr M]M 3 = ^(M^M,) 
j j 

= ^TrX;(L A .+tit; B .)- 1 (X J )-Tr(E i X;)A (71) 
j 

-Tr A* (J2j x j) + Tr A *Ej { l a 3 + tR Bj ) A- 
Next, observe that for any matrix W, 

J2 {L Aj +tR Bj )(W) = J2 {AjW + tWBj) = L E . Aj (W)+tR E , Bj (W). 
j j 

Therefore, inserting the choice A = {L-^Aj -\-tR^ . b,) (Ej^') i n (EE]) yields 

Tr (E^i)*y ^ (Ej X i) < E.Trx; 1 (X,). (72) 

for any t > 0. QED 
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Abstract 



We consider a generalization of relative entropy derived from the Wigner- 
Yanase-Dyson entropy and give a simple, self-contained proof that it is convex. 
Moreover, special cases yield the joint convexity of relative entropy, and for 
Tr K* APR B 1 ^ Lieb's joint concavity in (A, B) for < p < 1 and Ando's 
joint convexity for 1 < p < 2. This approach allows us to obtain conditions 
for equality in these cases, as well as conditions for equality in a number of 
inequalities which follow from them. These include the monotonicity under 
partial traces, and some Minkowski type matrix inequalities proved by Carlen 
and Lieb for Tri(Tr 2 A p 12 ) l / p . In all cases the equality conditions are indepen- 
dent of p; for extensions to three spaces they are identical to the conditions 
for equality in the strong subadditivity of relative entropy. 
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1 Introduction 



1.1 Background 

For matrices A\2 > acting on a tensor product of two Hilbert spaces, Carlen 
and Lieb [7J [S] considered the trace function [Tr 1 (Tr 2 Af 2 ) 9 / p ] and proved that 
it is concave when < p < q < 1 and convex when 1 < q and 1 < p < 2. 
They showed that this implies that these functions and the norms they generate 
satisfy Minkowski type inequalities, including a natural generalization to matrices 
A U 3 acting on a tensor product of three Hilbert spaces. They also raised the question 
of the conditions for equality in their inequalities. When q = 1, we show that this 
can be treated using methods developed to treat equality in the strong subadditivity 
of quantum entropy. Moreover, we obtain conditions for equality in a large class 
of related convexity inequalities, show that they are independent of p in the range 
< p < 2, and show that for inequalities involving t4 12 3 they are identical to the 
equality conditions for strong subadditivity (SSA) of quantum entropy given in [T3] 

These equality conditions are non-trivial and have found many applications in 
quantum information theory. For example, they play an important role in some 
recent "no broadcasting" results; see [TD] and references therein. They also play a 
key role in Devetak and Yard's [H] "quantum state redistribution" protocol which 
gives an operational interpretation to the quantum conditional mutual information. 

Our approach to proving joint convexity of relative entropy is motivated by 
Araki's relative modular operator [5], introduced to generalize relative entropy to 
more general situations including type III von Neumann algebras. It was subse- 
quently used by Narnhofer and Thirring [29] to give a new proof of SSA. The argu- 
ment given here is similar to that in [HI [311 [37] ; however, the unified treatment for 
< p < 2 leading to equality conditions, is new. Moreover, a dual treatment can be 
given for — 1 < p < 1 allowing extension to the full range (—1, 2). 

Wigner and Yanase [121 [33] introduced the notion of skew information of a density 
matrix 7 with respect to a self-adjoint observable K, 

-Trl[K,Y]{K,^- p ] (1) 

for p = i and Dyson suggested extending this to p G (0, 1). Wigner and Yanase [13] 
proved that ([T|) is convex in 7 for p = | and, in his seminal paper [20J on convex trace 
functions, Lieb proved joint concavity for p G (0, 1) for the more general function 

(A, B) ^ Tr K*A v KB l - v (2) 

for K fixed and A,B > positive semi-definite. This implies convexity of (0Q) and 
was a key step in the original proof [23] of the strong subadditivity (SSA) inequality 
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of quantum entropy. Moreover, it leads to a proof of joint convexity of relative 
entropjfl as well. It is less well known that Ando [H H] gave another proof which 
also showed that for 1 < p < 2, the function (j2J) is jointly convex in A, B. The case 
p = 2 was considered earlier by Lieb and Ruskai |24j . We modify what one might 
describe as Lieb's extension of the Wigner-Yanase- Dyson (WYD) entropy to a type 
of relative entropy in a way that allows a unified treatment of the convexity and 
concavity of Tr K*A P KB 1 ~ P in the range p G (0,2] and includes the usual relative 
entropy as a special case. Our modification retains a linear term, even for A ^ B. 
Although this might seem unnecessary for convexity and concavity questions, it is 
crucial to a unified treatment. 

Lieb also considered Tr K*A p KB q with p, q > and < p + q < 1 and Ando 
considered 1 < q < p < 2. In Section 12. 2\ we extend our results to this situation. 
However, we also show that for q ^ 1 — p, equality holds only under trivial conditions. 
Therefore, we concentrate on the case q = 1 — p. 

Next, we introduce our notation and conventions. In Section [2], we first describe 
our generalization of relative entropy and prove its convexity; then consider the 
extension to q ^ 1 —p mentioned above; and finally prove monotonicity under partial 
traces including a generalization of strong subadditivity to p ^ 1. In Section [31 we 
consider several formulations of equality conditions. In Section HI we show how 
to use these results to obtain equality conditions in the results of Lieb and Carlen 
[7J [8] . For completeness, we include an appendix which contains the proof of a basic 
convexity result from [37] that is key to our results. 



1.2 Notation and conventions 

We introduce two linear maps on the space of d x d matrices. Left multiplication 
by A is denoted La and defined as La{X) = AX; right multiplication by B is 
denoted Rb and defined as Rb(X) = XR. These maps are associated with the 
relative modular operator Aab = LaR~b" introduced by Araki in a far more general 
context. They have the following properties: 

a) The operators La and Rb commute since 

L a [Rb(X)} = AXB = R B [L A (X)} (3) 

even when A and B do not commute. 

In |23] only concavity of the conditional entropy was proved explicitly, but the same argu- 
ment [36[ Section V.B] yields joint convexity of the relative entropy. Independently, Lindblad [26] 
observed that this follows directly from ^ by differentiating at p = 1. 
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b) La and Ra are invertible if and only if A is non-singular, in which case L A X = 
La-i and R^ 1 = Ra- 1 - 

c) When A is self-adjoint, La and Ra are both self-adjoint with respect to the 
Hilbert Schmidt inner product (A,B) = Tr A*B. 

d) When A > 0, the operators L A and R A are positive semi-definite, i.e., 

TrX*L A (X) = TrX*AX > and 
TrX* J R j4 (X) = TrX*XA = Tr XAX* > 0. 

e) When A > 0, then (L j4 ) p = L AP and (i? j4 ) p = R AP for all p > 0. If A is also 
non-singular, this extends to all p G R. More generally f(L A ) = -^/(A) for 
/:(0,oo)^R. 

To see why (e) holds, it suffices to observe that A > implies L A and R A are linear 
operators for which f(A) can be defined by the spectral theorem for any function / 
with domain in (0, oo). It is easy to verify that A\<pj) = Oij\<pj) implies LA\4>j)(4>k\ = 
ctj\4>j) (4>k\ for k — 1 ... d so that the spectral decomposition of A induces one on La 
with degeneracy d and f(LA)\<f>j)(<f>k\ — f( a k)\4>j){4>k\- For Rb a similar argument 
goes through starting with left eigenvectors of B i.e., {<f>j\B = f3j{(j)j\. 

If a function is homogeneous of degree 1, then convexity is equivalent to subad- 
ditivity. Thus, if F(XA) = XF(A), then F is convex if and only if F(A) < J2j F( A j) 
with A = J2j Aj. We will use this equivalence without further ado. 

For B positive semi-definite, we denote the projection onto (ker B) 1 - by -P(kerB) x - 
We will encounter expressions involving commuting positive semi-definite matrices 
A, D with ker D C ker A. We will simply write AD^ 1 for 

lim ^/A(D + el)- 1 VA = AD' 1 P (kcrD) ^ = AD' 1 P (kciA) i- (4) 
with D~ x the generalized inverse. 

2 WYD entropy revisited and extended 

2.1 Generalization of relative entropy 

We now introduce the family of functions 

I x log x p — 1 



which are well-defined for x > and p 7^ 0. We will consider p £ (0,2] although it 
would suffice to consider p £ [|, 2]. For A, 5 strictly positive we define 

J P (K,A,B) = Tr VBK* g p (L A R~^} (KVB) (6) 

(Tr A* AA - Tr K*A P KB 1 ~ P ) p £ (0, 1) U (1, 2) 

TrAAMlogA-TrA*AAlog.B p = l (7) 

-l(TrA*AA -TrAAS^AM) p = 2 

When p — 1 and A = /, (jSJ) reduces to the usual relative entropy, i.e., 

Ji(/,A,5) = tf(A-B) = TrA(logA-lo gj B) (8) 

For p j£ 1, the function J p (K, A, B) differs from that considered by Lieb [20J and 
Ando [3J H] by the seemingly irrelevant linear term Tr K*AK and the factor p ^_ p ^ ■ 
However, this minor difference allows us to give a unified treatment of p £ (0, 2] 
because of the extension by continuity to p = 1 and the sign change there. 

One might expect to associate the exchange A -<->■ B with the symmetry p -h- 
(1 — p) around p = \- However, there are several subtleties due to the linear term, 
the exchange A -h- A*, and the case p — 1. Therefore, we use instead the observation 
that 

J P {K*,B,A) = TT^fAKg p (L B R A l )(K*^A) 
= Tr VBK* gi-p (L A R~j^} (K\/l3) 
= J^ P (K,A,B) (9) 



where, for — 1 < p < 1, we define 



-(l-x p ) p^O 



? p (a;)=^ 1 _ p ( a :- 1 ) = ^M v n (10) 

I — log a; p = 

and J p (K,A,B) = Tr ^BK* g p {L A R~ B l ) (Ky/B). 

The functions J P (K, A, B) and J P (K, A, B) have been considered before, usually 
with A = I, in the context of information geometry ((2J Section 7.2] and references 
therein) and by Petz [31] who used the term "quasi-entropy" . What is novel here is 
that we present a simple unified proof of joint convexity in A, B that easily yields 
equality conditions, shows that they are independent of p, and can be extended to 
other functions. 

The special case J P (I, A, I) is equivalent to the Tsallis [30] entropy. When 
A = A*, the relation 

J P (K,A,A) = _ Tr[A,^][A,A 1 -n (11) 



2 This was pointed out by Karol Zyczkowski 
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yields the original WYD information (up to a constant) and extends it to the range 
(0,2]. Morevoer, K = K* implies that J P {K,A<A) = Ji- P (K, A, A). Although 
neither g p (w) nor gi_ p (u>) is positive, their average^ 

G p (w) = \[g p {w) + wgpiw^ 1 )} > on (0, oo). Therefore, when K = K*, 

J p (K, A, A) = Tr (KVAyG p (L A R^)(KVA) > (12) 

The function J P (I, A, B) is a more appealing generalization of relative entropy 
than Tr A P B 1 ~ P because of Proposition [TJ which one can consider to be a generaliza- 
tion of Klein's inequality |17] . It allows one to use J P (I,A,B) as a pseudo- metric, 
as is commonly done with the relative entropy. 

Proposition 1 When U is unitary and A,B>0 with Tr A = Tr B = 1, then 
J P (U, A,B)>0 with equality if and only if A = U*BU. 

Proof: When U is unitary, 

J p (U, A, B) = J P (I, U*AU, B) = J p {I, A, UBU*). (13) 

Therefore, it suffices to consider the case U = I. For p G (0, 1) Holders inequality 
implies Tr A p B l ~ p < (Tr A) p (Tr B) l ~ p = 1 with equality if and and only A = B. It 
immediately follows that 

J P (I, A, B) ^^(Tr A -1)=0 and J P (I, A, B) = & A = B. (14) 

For p — 1, the result is well-known [381 Section 2.5.2] and originally due to O. Klein 
[17]. For p G (1, 2) we write p = 1 + r and again use Holder's inequality 

1 = Tr A = TxB" wmAB~ W+v B ^ 

i 

Tr (£-^1) AB~TO) 1+r ] 1+r (TiB)^ (15) 



< 



i, i 



< [Tr B~2 A 1+r B~2] 1+r (Tr A 1+r B~ r ) 1+r 

where we used Tr B — 1 and the second inequality follows from a classic result of 
Lieb-Thirring [251 Appendix B, Theorem 9]. QED 

Because the denominator p{l — p) changes sign at p = and p = 1, both g p and 
g p are convex. In fact, they satisfy the much stronger condition of operator convexity 
for p G (0, 2] and p G [—1,1) respectively. Since g(Q) = and 



9_M = \wt)( 1 - xP - 1 ) P^ 1 

x I log x p = 1 



(16) 



3 The definition of g p in ([TO]) differs from that in [18] by the exchange g p ■(-> g\~ v so that in [18] 
G(w) = \[g(w) +g(w)} for any g. In the convention used here, G p (w) = ^[g p (w) + gi- p (w)]. 
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it follows that g p (x)/x is operator monotone [31 QUI for p G (0, 2], i.e., g p can be 
analytically continued to the upper half plane, which it maps into itself. By applying 
Nevanlinna's theorem [H Section 59, Theorem 2] to g p (x)/x, one finds that g p (x) has 
an integral representation of the form 

g p (x) = ax + I - dv{t) 



ax 



x + 1 
- x 2 1 1 

-X + t t X + t 



tdu(t) 



(17) 



with u(t) > 0. Integral representations are not unique, and making a suitable change 
of variable in the classic formula 

1 



oo x p-l 



71 



x + 1 sinp7r c p 



P e (0,1) 



(18) 



allows us to give the following explicit representations 

+ c pJo C °(^Tt- 1 ) tP ~ ldt \ pe(o,i) 



p(i-p) 



x 



JO Vx+t 1 ^ x+t) l+t ai 



1 



p(l-p) 



x — c 



p- 1 Jo 



Jo 



P 



pe (1,2) 



p 



(19) 



Note that for p e (0, 2) the integrand is supported on (0, oo). This plays a key role 
in the equality conditions; therefore, we will henceforth concentrate on p 6 (0,2). 

Theorem 2 The function J P (K, A, B) defined in (EJ) is jointly convex in A,B. 

Proof: It follows from ( fl7|) that 

J p {K,A,B) = aTr K*AK (20) 

1 



+ 



Ti K*A 



~L A + tR 



B 



-(AK) 



TtKBK* 
t 



TtBK*- 



La + tR B 



(KB) tv{t)dt 



The joint convexity then follows immediately from that of the map (X, A, B) \— > 
Tr X* LA + tRB {X) which was proved in [57] following the strategy in [24] . The proof 
is also given in the Appendix. QED 



For other approaches see Petz [301 EI] , Effros [TT], The advantage to the argument 
used here is that it immediately implies that equality holds in joint convexity if and 
only if it holds for each term in the integrand. 



Corollary 3 The relative entropy H(A, B) = J\(I, A, B) is jointly convex in A,B. 
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2.2 Extensions with r ^ 1 — p. 



We now consider extensions of Theorem [2] to situations considered by Ando [4] and 
Lieb [20] in which B 1 ~ p is replaced by B r with r ^ 1 — p. Our approach uses an 
idea from Bekjan [6] and Effros [11] . We will also show that equality holds in these 
extensions only under trivial conditions. For this we first need an elementary lemma, 
which we prove for the concave case. 

Lemma 4 Let f(X) : [0, oo) i— > R be a non-linear convex or concave operator func- 
tion, let Ai, A 2 be density matrices and A = XA\ + (1 — X)A2 with A £ (0, 1). Then 
f(A) = Xf(A 1 ) + (1 - X)f(A 2 ) if and only if A 1 = A 2 . 

Proof: Since any operator concave function is analytic, non-linearity implies that / 
is strictly concave. If f(A) = Xf(Ai) + (1 - X)f(A 2 ), then 

(v, f(A)v) = X(v, f(A l )v) + (1 - X)(v, f\A 2 )v) (21) 

for any vector v. Now choose v to be a normalized eigenvector of A. Then inserting 
this on the left above and applying Jensen's inequality to each term on the right, 
one finds 

f((v,Av)) < Xf((v,A lV )) + (l-X)f((v,A 2 v)) (22) 

But this contradicts concavity unless equality holds, which implies that v is also 
an eigenvector of A\ and A 2 . But then the strict concavity of / also implies that 
{v,A\v) = (v,A 2 v). Since this holds for an orthonormal basis of eigenvectors of 
A, A\ and A 2 , we must have Ai = A 2 . 

Corollary 5 The function (A,B) > TrK*A p KB r is jointly concave on the set of 
positive definite matrices when p, r > and p + r < 1 . Moreover, when p + r < 1 
and K is invertible, the convexity is strict unless Bi = B 2 and A\ — A 2 . 

Proof: It is an immediate consequence of Theorem [2] that (A, B) 1— > Tr K* A p KB l ~ p 
is jointly concave in A } B. Now write Tr K*A p KB r = Tr K*A p K{B s f- p with s = 
r/(l — p). First, observe that for < s < 1 the function f(x) = x s satisfies the 
hypotheses of Lemma HI Therefore, 

(XB, + (1 - X)B 2 ) S > XBl + (1 - X)B S 2 (23) 

with < A < 1 and B\^ B 2 . The operator monotonicity of x 1— >■ x l ~ p for < p < 1 
then implies 

(XB 1 + (1 - X)B 2 ) r > (XB° + (1 - X)B°y- p , (24) 
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and the joint concavity of Tr K*A P KB 1 p implies 

Tr K*A p K(B a ) 1 ~ p > Tr K*(XA 1 + (1 - \)A 2 ) p K{\B s l + (1 - X)B^f' p (25) 
> ATr K*A\KBl {1 ' p) + (1 - A)Tr K* A P 2 K B 2 {l ~ p) 

where A = Av4i + (1— X)A 2 , B = XBi + (l — X)B2, which is precisely the joint concavity 
of Tr K*A p KB r . Moreover, equality in joint concavity implies equality in (125]) and, 
since K*A P K is strictly positive, this implies equality in (|2"5|) . Therefore, equality in 
(T55J) gives a contradiction unless B\ = B 2 . In that case, the joint concavity reduces 
to concavity in A for which, by a similar argument, equality holds if and only if 
A 1 = A 2 . QED 

Corollary 6 The function (A,B) h-> Tr K*A p KB 1 - r is jointly convex on the set of 
positive definite matrices when 1 < r < p < 2. Moreover, when r < p and K is 
invertible, the convexity is strict unless B\ = B 2 and A\ = A 2 . 

Proof: The argument is similar to that for Corollary [5j Write Tr K*A p KB l ~ r = 
TiK*A p K(B s ) 1 - p with s = f^. Since s G (0,1) and 1 - p G (-1,0) when 1 < 
r < p < 2, it follows that x s is operator concave and x x ~ p is operator monotone 
decreasing. QED 

2.3 Monotonicity under partial traces 

Let X and Z denote the generalized Pauli operators whose action on the standard 
basis is X\ek) = \ek+i) (with subscript addition mod d) and Z\ek) = e l2nk ^ d \ek) ■ It 
is well known and easy to verify that i Y2k Z h AZ~ k is the projection of a matrix 
onto its diagonal. If D is a diagonal matrix, then *Yl< k X k DX~ k = (Tr D)I. Now 
let {W n }n=i,2...tP denote some ordering of the generalized Pauli operators , e.g., 
W j+k{d ^ = Xi Z k with j, k = 1, 2 . . . d. Then \ J2 n W n AW* n = (Tr A)I and 

2 ^{W n ® I 2 ) A 12 (W n ® I 2 )* =h® (Tn A)=h® A 2 (26) 

n 

Using the fact that replacing W n by UW n U* with U unitary, simply corresponds 
to a change of basis which does not affect f[2"6"j) and then multiplying both sides by 
U* ® I 2 on the left and U ® I 2 on the right gives the equivalent expression 

i J2(W n U* g) J 2 ) A 12 (W n C/* <g) J 2 )* = /! ® A 2 (27) 

n 

Combining this with joint convexity yields a slight generalization of the well-known 
monotonicity of J P (K, A, B) under partial traces (MPT), first proved by Lieb in [20] 
for the case K\ 2 — I\ <g> K 2 when p G (0, 1). 
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Theorem 7 Let J p be as in (J7J, A i2 , B\ 2 strictly positive in M dl eg) M rfa an<i i^i 2 = 
Vi ® wi/i Vi unitary in . TTien 

J P (K 2 , A 2 , B 2 ) < J P (K 12 , A 12 , B l2 ) (28) 

Proof: Writing W n for W n ® J 2 and using (127)) gives 
J P {K 2 ,A 2 ,B 2 ) = ±J p {h®K 2 ,h®A 2 ,h®B 2 ) 



n 

= J p {V 1 ®K 2 ,A 12 ,B 12 ) 

where the final equality follows from the unitary invariance of the trace. QED 
Because Tr 12 (K <g> K 2 )Ai 2 (Fi g> K 2 )* = Tr 2 K 2 A 2 K* 2 , ([28]) is equivalent to 

TrK*A p 2 K 2 B^ - Tr ® K 2 )*A{ 2 (Vi ® K 2 )B\ 2 ' P ^ G ^ • (29) 

[<0 p6(l,2) 

We can obtain a weak reversal of this for p e (0, 1). The argument in the Appendix 
shows that for any p and fixed A, B > both Tr K* A P KB X ~ P and Tr K*AK are 
convex in .fT. This was observed earlier by Lieb [20] and also follows from the 
results in [23] - One can then apply the argument above in the special case A\ 2 = 
Ii ® A 2) B\ 2 = l\® B 2 to conclude that 

Tr K 2 A P 2 K 2 B 1 2 ~ P < j^iKl 2 {h®A 2 YK 12 {h®B 2 ) l - p (30) 
< TrK* 2 (I 1 ®A 2 ) p K 12 (I l ®B 2 ) 1 - p (31) 

independent of whether p < 1 or p > 1. However, because the term TrK*AK is 
convex rather than linear in K, (130]) does not allow us to draw any conclusions about 
the monotonicity of J P (K 12 , Ii <g) A 2 , I\ <g> B 2 ). 

To prove Theorem [7] we showed that joint convexity implies monotonicity; the 
reverse implication also holds. Let A%, . . . , A m , Bi, . . . , B m be positive definite ma- 
trices in Md, A = J2j Aj, B = ^ . Bj, and put 

An = l e i>( e il ® A i> B 12 = J2 |e,)(e,| <8> fly, (32) 
i j 

for ei, . . . , e m the standard basis of C m . Then A\ 2 and 5i 2 are block diagonal, and 
A 2 = Triv4i 2 = ^2 k Ak = A and similarly for B. Then if monotonicity under partial 
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traces holds, one can conclude that 

J P (K,A,B) = J P (K,A 2 ,B 2 ) 

< J p (h ® K, I 12 , B 12 ) = ^ J P (K, A v B 3 ) (33) 

3 

Thus, monotonicity under partial traces also directly implies joint convexity of J p . 
Applying ( I2"gj) in the case K = I, and A 12 h-> A 123 and 77 12 i-> A 12 <8> 7 3 gives 

7p(7 23 , ^23, ^2 <8 h) < Jp(h23, A 123 , A 12 ® 7 3 ) (34) 

When p = 1, it follows from (J7|) that 

JiC/aa, A 23 , A 2 <g> 7 3 ) = 77(A 23) A 2 (8) 7 2 ) = -S(A 23 ) + S{A 2 ) 
where S(A) = — Tr A log A. Thus, f )34p becomes 

-S(A 23 ) + S{A 2 ) < -S{A 123 ) + S(A 12 ) 

or, equivalently 

S(A 2 ) + S{A 123 ) < S(A 12 ) + S(A 23 ) (35) 
which is the standard form of SSA. 

3 Equality for joint convexity of J p (K,A,B). 

3.1 Origin of necessary and sufficient conditions 

Looking back at the proof of Theorem [2J we see that for p e (0, 2), equality holds in 
the joint convexity of J P {K, A, B) if and only if equality holds in the joint convexity 
for each term in the integrand in (fT7|) . It should be clear from the argument given 
in the Appendix, that this requires Mj = for all j with Mj given by flTOj) . This is 
easily seen to be equivalent to 

(L Aj +tR Bj )- 1 (X J ) = (L A + tR B )- 1 (X) for all j. (36) 

with A = J2j A i, B = Ej B h and x = J2j x 3 with x j = A j K and/or Xj = KB S . 
By writing AK = L A (K) in the former case and KB = R B {K) in the latter we 
obtain the conditions 

(I + tA A ] B] )- 1 (K) = (I + tA AB )- 1 (K) Vj Vt>0 (37a) 
(A A]Bi +tI)- 1 (K) = (A AB + tI)- 1 (K) Vj Vt>0 (37b) 
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From the integral representations ( II 9p . one might expect it to be necessary for either 
or both of (137aj) and fl37bj) to hold depending on p. In fact, either will suffice because 
(I37a|) holds if and only if ( 137bj) holds. Because A AB is positive definite, by analytic 



continuation (137bj) extends from t > to the entire complex plane, except points 
—t on the negative real axis for which t e spectrum(A J 4s). Therefore, by using the 
Cauchy integral formula, one finds that for any function G analytic on C\(— oo, 0] 
G(A AjB .)(K) = G(A AB )(K). 

Theorem 8 For fixed K , and A = ^ . Aj, B = Bj, the following are equivalent 

a) J P (K, A, B) = J2j J P (K, ^, Bj) for all p G (0, 2). 

b) J P {K, A, B) = £\ J P {K, Aj, Bj) for some p E (0, 2). 

c) (A AjBj + tI)- l (K) = (A AB + tI)'\K) for all j and for all t > 0. 

d) AfKBp 1 = A it KB- it for all j and for allt>0. 

e) (log A - log Aj)K = If (log .B - log Bj) for all j . 

Proof: Clearly (a) =^ (b). The implications (b) =>■ (c) =>- (d), as well as (b) ^> 
(a), follow from the discussion above. Differentiation of (d) at t = gives (d) =3- 
(e), and it is straightforward to verify that (e) =>■ (b) with p = 1. Moreover, (d) 
implies £\ Tr K*AfKBj~ u = Tr K*A it KB 1 ~ it for all t, which implies (a) by analytic 
continuation. QED 



3.2 Sufficient subalgebras 

When K = I,we can obtain a more useful reformulation of the equality conditions by 
using results about sufficient subalgebras obtained in [TU [15l [33] . Since the definition 
and convexity properties of J P (I, A, B) extend by continuity to positive semidefinite 
matrices, with ker B C ker A, we will formulate the conditions in this more general 
situation, using the conventions in Section 11.21 

Let iV C Md be a subalgebra, then there is a trace preserving conditional ex- 
pectation E N from M d onto N, such that Tr AX = Tr E N (A)X for all X e N. In 
particular, if N = M dl ® I C M dl ® M da , then we have Ejv(Ai 2 ) = Tr 2 A ® ±1. 

Let Qi, • • • , Qm ^ and assume that ker Q m C ker for all j. The subalgebra 
is said to be sufficient for {Qi, . . . ,Q m } if there is a completely positive trace 

preserving map T : iV — > Ma, such that T(Ejf(Qj)) = Qj for all j = l,...,m. 

This definition is due to Petz [33j [32] and it is a quantum generalization of the 

well known notion of sufficiency from classical statistics. In |33j, it was shown that 

sufficient subalgebras can be characterized by the condition 

H(Qj, Qm) = H(E N (Qj), E N (Q m )), for all j 
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We combine this with the results of the previous section to obtain other useful 
characterizations of sufficiency. 

Theorem 9 Let Qi, ■ ■ ■ ,Q m G M£ be such that kerQ m C ker Qj for all j. Let 
N Q Ma be a subalgebra. The following are equivalent. 

(i) N is sufficient for {Q\, . . . , Q m }. 

(ii) E N {Qj) it E N {Q m )' it P (}ssT Q m) _L = QfQ^ u , for all 3, t G R. 

(Hi) There exist Q?,o G N + , and D G M£ ', such that ker D = kerQ m , and 
Qj = Q jfi D forj = l,..., m. 

(iv) J p (I,Qj,Q m ) = J p (I,E N (Q j ),E N (Q m )) for all j and some p G (0,1) 

The proof of the conditions (i) - (iii) can be found in [H] , see also |28j . The condition 
(iv) was proved in [T5] . 

3.3 Equality conditions with K = I 

Theorem 10 Let A\, . . . , A m and Bi, . . . , B m be positive semi- definite matrices with 
ker Bj C ker Aj, and let A = £V Aj,B = Bj. Then the following are equivalent. 

a) J P (I, A, B) = £\ J P (I, A,, Bj) for all p G (0, 2) . 

b) J P (I, A, B) = £. J P (I, Aj, Bj) for some p G (0, 2). 

c) AfBp 1 = A it B- it P {]lBlB . ) i. for all j andteR 

d) There are positive matrices D\, . . . , D m , with ker Dj = ker Bj, such that 
[Aj, Dj] = [Bj, Dj] = 0, and with D = J2j Dj 

Aj = AD~ l Dj, B 3 = BD^Dj (38) 

Proof: As in Section EU (b) implies (ESJ on (kerS^, with X d = B j: X = B. This 
gives 

(A AjB . + tl)-\l) = (A AB + tl)-\l) on (ker 5^. (39) 

Then (c) follows from the Cauchy integral formula as in Section 13.11 

To show (c) implies (d), we will use Theorem [9j First let N = I®Md C M m ®Md 
and let A\ 2 , B\ 2 be the block-diagonal matrices in M m ®Md, defined by (l3"2~j) . Clearly, 
we have ker A 12 2 ker B 12 = J2j \ e j)( e j\ ®ker ^ and E N (A 12 ) = ~J <8> A E N (B 12 ) = 
i/ <g> B. Then (c) implies E N (A\ 2 ) U E N (B 12 )~ U P^ kcr , ^ ± = Af 2 B^ for all t. Then 
by using Theorem |9] with Q\ = A\ 2 , Q m = Q 2 = B\ 2 , we can conclude that there are 
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positive matrices A , B e M d and D 12 G (M m <g> M d ) + , such that ker D u = ker £> 12 , 
[J ® A , D 12 ) = [I <8> B , £12] = and 

Au = (I <g> A )Di2, £12 = (I ® 5 )£)i2 (40) 

Since Ai 2 , -B12 are block diagonal, D\ 2 = J2j \ e j){ e j\®Bj must also be block diagonal 
with Dj e M+, kerDj = ker Bj, [A , Dj] = [B Q , Dj] = for all j and 

Aj = A Dj, Bj = B Dj. (41) 

Taking Tri in (HDD gives A = A D and B = B D. Using this in flU]) gives ([MD which 
proves (d). The implications (d) =>■ (a) =>- (b) are straightforward. QED 

We return briefly to the case of arbitrary K. Note that if the condition (d) holds 
and [Dj,K\ = for all j, then J p {K,A,B) = J2j J p( k , An B i) for a11 P e (°' 2 ), 
this gives a sufficient, but not necessary, condition for equality if K ^ I. The next 
result reduces the case of K unitary to K = I. Then, we can apply the conditions 
of Theorem [10] to Aj and KBjK*. 

Theorem 11 If K is unitary, then J P (K, A, B) = ^2,. J P {K, Aj,Bj) if and only if 
J p (I,A,KBK*) = J2 j J p (I,A j ,KB j K*) ^ 

Proof: When K is unitary, then KB P K* = (KBK*) P which implies J P (K, A, B) = 
J P {I,A,KBK*). QED 

One can try to extend the results of this section to the case \\K\\ < 1, and hence 
to all K, by using the unitary dilation 

where L = U(l — IK] 2 ) 1 / 2 and K = U\K\ is the polar decomposition. Then, with 

A=( A °) B=( B ° 

we have J P (K, A, B) = J P (U, A, B), so that we may use Theorem[TT]to get conditions 
for equality. But note that the conditions of Theorem [TD] require that ker UBjU* C 
kerA,- and it can be shown that this implies P^crA^^KP^B.^K* — -P(ker a^ 1 - , 
where Pa/ denotes a projection onto the subscripted space. In particular, if all Aj 
and Bj are invertible, this restricts us to unitary K. 
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3.4 Equality in monotonicity under partial trace 

It is easy to see that when A\ 2 — A\ <g) A2 and Byi = B\® B2, then J P (I, A12, B\ 2 ) = 
J P (I, A2, B2) if and only if A\ = B\ with TrAi = 1. However, it is not necessary 
that A\2 = A\ £g> A2. The equality conditions are given by the following theorem. 

Theorem 12 Let K 12 = hi and A 12 , B 12 G B{U\ ®U 2 ) + , with ker5i 2 C ker A 12 . 
Equality holds in ( 1281) if and only if 

(i)U2 = ® n U L n ®U*, 

(it) Am = ® n A^® A* with A L n G B{U X <8> U L n Y and A* G B(H^) + , 
(in) B 12 = 0„ B% ® S« uwtfi ^ G g> ^)+ and G B{U*) + , 

(iv) A% = B^ for all n. 

Proof: Let us denote A s = ±WjA 12 W*, B i = ±WjB l2 Wj, with Wj defined as in 
the proof of Theorem Then we get that equality in f )28|) is equivalent to 

j P (ii2, E ^ E ^) = E J ^> 4. s i) 

i j j 

By Theorem [T0| equality for some p implies equality for allp, so that J p (h 2 , Ai 2 , Bi 2 ) = 
J P {I 2 , Tri A, Tr x B) = J P (I 12 , E N (A 12 ), E N {B l2 )) for p G (0, 1), where N is the subal- 
gebra Ii®B(H 2 ) C B(Hi<®T-L 2 )- Hence N is sufficient for {A12, -B12} and, by Theorem 
El there are some A R ,B R G i3("H 2 ) + and D G B(7£i <g> %) + , ker L> = kerS 12 , such 
that [{h <g> A R ), D] = [(h ®B R ),D} = and 

A l2 = D(h ® S12 = £>(/i ® (42) 

Now let Mi be the subalgebra in B(H 2 ), generated by A R , B R . Then D G (7i<g>Mi)' = 
B(H{) <S> M[ where M' denotes the commutant of M. There is a decomposition 
% = n Un ® n ni such that 

M[ = ) ® 1* M 1 = 01^ 

n n 

and -D = n D n Cg) 1* where £> n G ® Since A R ,B R G Mi, we get the 

result, with A^ = B^ = D„. The converse can be verified directly. QED 

Applying this result in the case A X2 i-> A 123 and B 12 \-t A 12 eg) I 3 gives equal- 
ity conditions in (1341) . Since these are independent of p, they are identical to the 
conditions, first given in [15] . for equality in SSA (1551) which corresponds to p = 1. 

Corollary 13 Equality holds in ( 134|) and on/y z/ 

(i) n2 = @ n u L n ®u*. 

(ii) A123 = 0„ A^ <g> A* with A L n G B{Hi <g> and A* G <g> ft 3 ) 
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Proof: It suffices to let A 12 — > A 123 and B 12 — > A 12 <8> h in Theorem [T2J QED 

To apply these results in Section HJ it is useful to observe that condition (ii) in 
Corollary [13] above can be written as 

A 123 = {F L ®h)(h®F R ) (43) 

with F L e B(Hi ® H 2 ) + , Fr G B(H 2 <g> H 3 ) + , [F L <g) I 3 ,h ® Fr] = 0. Combining 
this with part (d) of Theorem [10] gives the following useful result, which essentially 
allows us to bypass the need to apply Theorem [TO] to J P (I, Aj, W n AjW n ). 

Corollary 14 Let Aj eM dl ®M d . 2 , A = ]T Aj. Then 

J P (I 12 , A, (Tr 2 A) ®I 2 ) = Y, J P (h2, Aj, (Tr 2 Aj) <g> J 2 ) (44) 

j 

i/ an<i onZy i/ t/iere are .Dj e , swc/i i/iat ker .Dj = ker Tr 2 Aj, [Aj, Dj ® J] = and 
= A(D- l Dj <g> /) wit/i D = j2jDj. 

Proof: Let A 123 = £\ | ej ) (e 3 -| g> Aj G M m g> M dl <g> M d2 , then A = A 23 G M dl <g> M d2 
and (|44p can be written as 

Jpihs, A 23 , A 2 (8) J 3 ) = J p (/i 23 , A 123 , A 12 (8) J 3 ) 

By (]4"3]) . this is equivalent to the existence of Fx and Fr, [(Fx <E> / 3 ), (ii <8> Fr)] = 0, 
such that A 123 = (Fx, <g> / 3 )(ii <S> F R ). Since A(i)( 23 ) is block-diagonal, F L must be of 
the form F L = £\ le^-) (e^ | ® Dj, so that Aj = F R {Dj ® I). Then Tr 2 Aj = DjTr 2 F R 
which implies that ker Dj C ker Tr 2 Aj. If we let Pj = F( kerTr2J 4_.-)±, then Pj commutes 
with Dj and 

Aj = (Pj ® I)Aj = (PjDj ® I)F R , 

so that we can assume that ker = ker Tr 2 Aj, by taking PjDj instead of Dj. Taking 
Tri of (@3D gives A = (D®I 3 )F R = F R (D®I 3 ) so that Aj = A(D _1 Dj (g>/). QED 

4 Equality in joint convexity of Carlen-Lieb 

Carlen and Lieb [8] obtained several convexity inequalities from those of the map 

T m {K, A) = Tr {K*A p K) q/p (45) 
using an identity which we write only for q — 1 and p > 1 in our notation as 

T Ptl (K, A) = (p - 1) inf { J P (F, A, X) + ±Tr X + ^Tr X*AX : X > 0} (46) 
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We introduce the closely related quantity 



f pA (K,A) = M{j p (K,A,X) + 1 -TiX:X>0} 
= (f=xjP P ,x(K,A)- p -TrK*AK) 



(47) 
(48) 



which is well-defined for all p G (0, 2) and allows us to continue to treat the cases 
p < 1 and p > 1 simultaneously, as well as include the special case p = 1 for which 

f 1A (K,A) = -Tr K*AK \og(K*AK) + Tr K*(A log A)K + Tr K*AK 

= S(K*AK) + TrKK*A\ogA + TrK*AK (49) 

Since we are dealing with finite dimensional spaces, the infimum in (|46|) has a 
minimizer which satisfies 



X min = (K*A P K) 



i/p 



(50) 



For fixed K, let Xj denote the minimizer associated with Aj. Then 



? Pi i(K,A 1 )+T p , 1 (K,A 2 ) 



- J P (K, A u X x ) + iTr Xx + J P (K, A 2 , X 2 ) + ^Tr X 2 

> J p (K,Ax + A 2 ,Xx + X 2 ) + ±Tr(Xx + X 2 ) (51) 

> inf {j p (K,Ax + A 2 ,X) + ±TrX:X > 0} 

= T Pil (K,A + A 2 ) (52) 

which proves convexity of T p l . Note that equality above requires both X = ^2jXj 
and J p {K, A, X) = ^\ J P (K, Aj, Xj), where X is the minimizer associated with A. 

Now we introduce some notation following the strategy in the published version 
of [8]. Let |1) denote the vector (1,1,..., 1) with all components 1 and |ei) the 
vector (1, 0, . . . , 0). Define 



K = ±1® |l)(ei| 



and 



^A jk <g> |e fc )(e fe | = ^ A jk 

k k 



(I 


o . 


.. o\ 
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(53) 



(54) 
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and A = Y,j Aj = Y Jk A k ® \e k ) (e k \ = ® k A k with A k = J2j A jk . Then 

K?A p K= (Y^A{) ® |ei)<ei| 

k 

With this notation, we make some definitions following Carlen and Lieb but modified 
to allow a unified treatment of p G (0, 2). 

$ (p>1) (A) = $ (p , 1) (A 1 ,A 2 ,A 3 ...) = T Pjl ()C,A) (55) 

= Tr(A? + ^ + Af + ...) 1/p 
$ M (A) = $ M (A 1 ,A 2 ,A 3 ...) ee T Pil (/C,A) (56) 



(p-i) 



The definitions of $ and $ apply only when .A is a block diagonal matrix in ®M^ 2 . 
We now extend this to an arbitrary matrices An E <g> M r i 2 . 

*(p,i)(^i2) = T ri (Tr 2 Ay 1/p (57) 
$( P ,i)(A 2 ) s ^[* fel) (A 12 )-±TrA 12 ] (58) 

For p = 1, the formulas with hats are related to the conditional entropy, from which 
they differ by a constant 

$(i,i)(Ai, A 2 , A 3 . . .) - Tr Ai2 = -Tr{E k A k )log(Z k A k )+J2 k A k logA k 

= S(J2 k A k )- S(A 12 ) = Jx (/, A 12 , Tr 2 A 12 ® J a ) (59) 
$(i,i)(Ai 2 )-TrA 12 = S(A)-5(A 12 ) = F(A 12 ,A 1 ®/ 2 ) (60) 

When A12 is block diagonal, ^/( Pi i)(Ai 2 ) = $( Pi i)(Ai 2 ) with the understanding 
that Tr 2 Ai 2 = J2 k A k - Now let W n denote the generalized Pauli matrices as in 
Section I2.3j W n = I\® W n and define 

A 123 = J2 W n A 12 W* n ® |e n )(e n | = W n A 12 W* n (61) 

n n 

so that A\ 23 is block diagonal with blocks W n A.i 2 W*. Then 

<V *( P ,d(a 2 ) = $(Ai2)(3)) = HmAnWi, mA 12 w;, ...)■ (62) 

It is straightforward to show that for p G (0, 2) the functions 3>(p,i)(y4.) and \E , ( Pi i)(A) 
are all convex in A, inheriting this property from the quantities from which they 
are defined. In view of (15"9~j) and (16"U|) . the conditions for equality in the next two 
theorems are not surprising. 
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Theorem 15 The function *&(p,i)(A) is convex in A for p G (0,2). Moreover, the 
following are equivalent: 

(i) J P (I, A, (Tr 2 A) ® h) = Ej J P {l, A j, ( Tr 2 A3) ® / 2 ) , 

(mJ There are matrices Dj > 0, D = 52,- -Dj, suc/i £/&et£ [Ajfc, Dj] = ; kerDj- = 
ker (Sfc A'fe) anc ^ A jk = A k D~ l D 3 , 

(m) § {p ,i){A u A 2 , A 3 ...) = J2j $( Pl i)(A-i- A i2, A j3 ...). 

Proof: It follows from Corollary [TH and the fact that Aj are block-diagonal that (i) 

(ii) and it is straightforward to verify that (ii) =>■ (iii). Moreover, (hi) implies (i) 
for p = 1, by (1591) . To show that (iii) implies (ii) for p 7^ 1, observe that (iii), implies 
T p>1 (JC, A) = J2j T p>1 (JC, Aj), and this implies 

J P {JC, A, X) = Aj, X 3 ) (63) 

j 

where Xj = (JCA^IC) 1 ^ = Xj ® |ei> <ei| and J2j X j = X = (/CM^) 1 ^ = 
X® | ei )( ei |, with X S = (E fc ^ fc ) 1/P and X = (£ fe ^) 1/p . Since 

^JCX^ = J2A%Xj-^\e 1 )(e 1 \, 

k 

with a similar expression for K.*A P JCX 1 ~ P ', we find 

/ J J P (I, A k , X) = J P {K, A, X) — J P (K,, Aj, Xj) = J P (I, A jk , X ) 

k j k.j 

Convexity then implies that we must have 

J p {I,A k ,X) = Y J J P (I^ ]k ,X j ) Mk. (64) 

3 

Since kerXj C her Ajk, Theorem [TU1 implies that 

AtX~ u P {kevX ^ = A%X~ U , for all k,j,t (65) 

After writing A k = V. |e 3 -)(ej| <8> Aj k , X = V. |e 3 -)(ej| <8> -Xj, this reads 

^x- u = (±1 ® TM*)*(£/ ® TnX)-^P (kcr ^ )± , 

so that, by Theorem [91 there are elements G and -D G (M m <g> Md) + , such 
that ker £> = kerX, [(J g) •£?*.),£?] = and A fc = (I ® As before, one finds 

D = J?. I e j ) ( ej I ® for some G M/ which implies (ii). QED 
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Theorem 16 The function ^(p,i) (-Aw) is convex in A\ 2 for p G (0,2). Moreover, if 
we let A\ 23 denote the block diagonal matrix with blocks W n AW*, the following are 
equivalent: 

(i) Jp{I,Ax2z,Ax®I 2 ?) = J2j Jp(l, (^123) j, (,Ai)j®Im) withA 123 defined by (jSU, 

(ii) There are matrices Dj G M^, D = such that ker Dj = ker(.Ai) 3 -, 
[Aj, Dj g> /] = and Aj = A{D~ 1 D j g> I). 

(ni)$( P ,i){A) = Y,^(p,i){Aj). 

Proof: It follows from the definition of -4i23, that d 2 p ^^(A) = $(^123) • The 
equivalence (i) <^ (iii) follows immediately from Theorem [151 an d (i) (ii) can be 
shown to follow from Corollary O QED 



Theorem 17 The following monotonicity inequalities hold, 

*(p,i)(^2 3 ) < *(p,i)(^i23), P e (0,2) (66a) 

^ (p ,i)(^2 3 ) > *(p,i)(^i23), p e (0, 1) (66b) 

GAm) < *(p,l)(^123), P e [1, 2) (66c) 
Moreover, equality holds if and only if the conditions of Corollary\T^ are satisfied. 

Proof: It suffices to give the proof for \I/ since the other inequalities follow immedi- 
ately. The argument is similar to that for Theorem [7J Let W n denote the generalized 
Pauli matrices of Section [231 but now let W n = W n <8> hz- Then the convexity of 
^ { P ,i){A 2 -i) implies 

$(p,i)(A 23 ) = ^(p,i)(h®A 23 ) 

= T^( P MTXn^nA 123 W n ) 

< ^5^^( P ,l)(>V„^123>V n ) = $( P ,1)(A 123 ) 
n 

where we used the invariance of \I/ under unitaries of the form U\ ® I 23 . In the case 
p = 1, it follows from (jSQD that $(i,i)(^ 23 ) < 1)^123) becomes 

S(A 2 ) - S(A 23 ) < S(A l2 ) - S(A 123 ) (67) 

which is SSA. Because the equality conditions in Theorem [TBI are independent of p, 
they are identical to those for SSA, which are given in Corollary [131 QED 



20 



The Carlen-Lieb triple Minkowski inequality for the case q = 1 is an immediate 
corollary of Theorem [T71 Observe that 

Tr 3 Tr 1 (Tr 2 ^ 23 ) 1/p = V M (A {13U2) ) (68a) 
Tr 3 [Tr 2 (TrM 12 3) p ] 1/P = *(p,i)(^a 2 ) (68b) 
so that it follows immediately from (J55b ) that 

Tr 3 [Tr 2 (Tn^l 123 ) p ] 1/p = ^ {pA) (A 32 ) < ^ {p ,i)(A 132 ) = Tr 3 Tn (Tr 2 ^ 23 ) Vp (69) 

for 1 < p < 2 and from (166b ) that the inequality reverses for < p < 1. Moreover, 
the conditions for equality are again independent of p and identical to those for 
equality in SSA, given in Corollary [TBI 

5 Final remarks 

It should be clear that the results in Section [2] are not restricted to J P (K, A, B). The 
function g p (x) given in ([6]) can be replaced by any operator convex function of the 
form g(x) = xf(x) with / operator monotone on (0, oo). Moreover, if the measure 
u{t) in ( TT7|) is supported on (0, oo), then the conditions for equality are identical to 
those in Section |3j 

In particular, our results go through with g p replaced by g p and J p (I, A, B) 
replaced by J P (I,A,B), which is well-defined for p 6 [—1,1) with J Q (I,A,B) = 
H(B,A). Thus our results can be extended to all p G (—1,2). The case p = 2 
reduces to the convexity of (A,X) i— > TrX*y4 _1 X with A > proved in [21]. One 
can show that equality holds if and only if Xj = AjT V j with T = A~ X X. We 
recently learned that Kiefer [16] proved the p = 2 convexity, by a different method, 
much earlier and also found these equality conditions. 

There have been various attempts, e.g., the Renyi [35] and Tsallis [ID] entropies, 
to generalize quantum entropy in a way that gives the usual von Neumann entropy 
at p = 1. In this paper we have considered two extensions of the conditional entropy 
involving an exponent p E (0, 2), namely, 

• J P (I, Au, Ai) which gives Tr y4. 23 ^4. 2 ^ Tr ^4 p 23 ^4.i 2 p /-, r}\ and can be 

— P t (_!, 2) 

thought of as a pseudo-metric; and 

• %i)C4i2) which gives Tr 2 (Tr 3 J^y/P ^ Tr 12 (Tr 3 ^ 23 ) 1/p ^ | (1^ 2) and 
can be thought of as a pseudo-norm. 
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These expressions are quite different for p ^ 1, but arise from quantities with the 
same convexity and monotonicity properties, as well as the same equality conditions 
which are independent of p. Moreover, both yield SSA at p — 1 and the equality 
conditions for p ^ 1 are identical to those for SSA. This independence of non-trivial 
equality conditions on the precise form of the function seems remarkable. 

If one uses g p and J P (I,A,B) from ffTUl) . then the inequalities above hold with 
p G (1, 2) replaced by p G (—1, 0) and SSA corresponds to p — 0. 



A Proof of the key Schwarz inequality 

For completeness, we include the proof of the joint convexity of (A, B,X) i— > 
TyX*(L a + ti? B ) _1 (X) when A,B > and t > 0. Since this function is homo- 
geneous of degree one, it suffices to prove subadditivity. Now let 

M j = (L Aj + tRBiY^iXj) - (L Aj + tR B .f'\K). (70) 

Then one can verify that 

< ^Tr MjMj = ^(M,,M,) 
j j 

= Y^1tX;(L Ai +tRB i )- 1 (X j )- r ItQ: j X;)A (71) 

3 

-Tr A* (£ . Xj) + Tr A*£ . (L A . + tR Bj ) A. 
Next, observe that for any matrix W, 

{La, + tR B .){W) = {A 3 W + tWB s ) = L E . A .(W) + tR E . B .(W). 

3 3 

Therefore, inserting the choice A = (L^Aj +tRT,j B ]) (Ej^j) i n ( j^TJ yields 

Tr E^)* 7 ^ (£j x j) < E.TrX; 1 (Xj). (72) 

for any t > 0. QED 
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